Two-stage sampling for etiologic studies. Sample size and power.
نویسندگان
چکیده
Preexisting computerized databases are potentially valuable sources of epidemiologic data. Since such databases are infrequently created specifically for etiologic research, data may be available for the exposure of interest and, through record linkage, for the endpoint of interest, but lacking for potential confounders. Because of the size of these databases, two-stage sampling is an efficient alternative to surveying the entire study population for confounder data. At stage 1, information on exposure and disease status is obtained for the entire study population. Confounder data are collected for probability-selected subsamples at stage 2. Logistic regression is performed on the stage 2 samples, with the parameter estimates and variances appropriately corrected to account for the stage 1 data. In this paper, the authors present methods for determining the required stage 2 sample size in the case of categorical exposure and confounding variables. Sample size tables, power curves, and a computer program have been produced to accommodate a binary exposure and a single binary confounder. With the increasing availability of preexisting yet incomplete databases, the potential for use of two-stage sampling will greatly increase in the future. This investigation provides a basis for estimating the number of participants to sample for the collection of confounder data at the second stage.
منابع مشابه
Bayesian Sample size Determination for Longitudinal Studies with Continuous Response using Marginal Models
Introduction Longitudinal study designs are common in a lot of scientific researches, especially in medical, social and economic sciences. The reason is that longitudinal studies allow researchers to measure changes of each individual over time and often have higher statistical power than cross-sectional studies. Choosing an appropriate sample size is a crucial step in a successful study. A st...
متن کاملPower in the phenotypic extremes: a simulation study of power in discovery and replication of rare variants.
Next-generation sequencing technologies are making it possible to study the role of rare variants in human disease. Many studies balance statistical power with cost-effectiveness by (a) sampling from phenotypic extremes and (b) utilizing a two-stage design. Two-stage designs include a broad-based discovery phase and selection of a subset of potential causal genes/variants to be further examined...
متن کاملDetermining the sample size required to compare vegetation and soil characteristics in two independent groups using effect size
Extended Abstract Background and objectives: One of the important steps in assessing rangeland vegetation is determining the sample size. Adequacy of sample size and its determination is always one of the main concerns of rangeland vegetation analyzer. There are two general methods for determining the sample size in rangeland science: graphic and statistical methods. In this study, the sample...
متن کاملBayesian Determination of Sample Size in Longitudinal Studies with Binary Responses Using Random Effects Models
Sample size determination is important in all statistical studies including longitudinal studies. This is usually done by considering a target power to reduce the costs of sampling. Choosing the right sample size using efficient methods, ensures that the researcher achieve goal of the study, by spending the least amount of energy, time and money. In this article, using a method based on simulat...
متن کاملDesign of Economic Optimal Double Sampling Design with Zero Acceptance Numbers
In zero acceptance number sampling plans, the sample items of an incoming lot are inspected one by one. The proposed method in this research follows these rules: if the number of nonconforming items in the first sample is equal to zero, the lot is accepted but if the number of nonconforming items is equal to one, then second sample is taken and the policy of zero acceptance number would be ap...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- American journal of epidemiology
دوره 146 5 شماره
صفحات -
تاریخ انتشار 1997